Quantized biopolymer translocation through nanopores: departure from simple scaling 
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We discuss multiscale simulations of long biopolymer translocation through wide nanopores that 
can accommodate multiple polymer strands. The simulations provide clear evidence of folding 
quantization, namely, the translocation proceeds through multi-folded configurations characterized 
by a well-defined integer number of folds. As a consequence, the translocation time acquires a 
dependence on the average folding number, which results in a deviation from the single- exponent 
power-law characterizing single-file translocation through narrow pores. The mechanism of folding 
quantization allows polymers above a threshold length (approximately 1, 000 persistence lengths for 
double-stranded DNA) to exhibit cooperative behavior and as a result to translocate noticeably 
faster. 

The translocation of biopolymers through nanopores is drawing increasing attention because of its role in many 
fundamental biological processes, such as viral infection by phages, inter-bacterial DNA transduction or gene therapy 
This problem has motivated a number of in vitro experimental studies, aimed at exploring the translocation 
process through protein channels across cellular membranes 2^, |3j] , or through micro- fabricated channels [4] . Recent 
experimental work has addressed the possibility of ultra-fast DNA-sequencing using electronic identification of DNA 
bases, while tracking its motion through nanopores under the effect of a localized electric field fEI]. Experiments also 
reported that the translocation of biopolymers through pores wide enough to accommodate multiple strands, exhibits 
the intriguing phenomenon of current-blockade quantization, that is, discrete jumps of the electric current through 
the pore during the translocation process [Ij. This was interpreted as indirect evidence that the polymer crosses the 
pore in the form of discrete configurations, associated with integer values of the folding number, that is, the number 
of strands simultaneously occupying the pore during the translocation. This behavior has been recently confirmed by 
direct observation of multi- folded configurations in large-scale simulations of biopolymer translocation Q . 

In the present work, we report on the behavior of long polymers undergoing translocation through relatively 
wide pores, which exhibits qualitatively new features. Under these conditions, we predict from our simulations that 
folding quantization is actually enhanced and leads to faster translocation by effectively reducing friction through the 
pore region. The observed behavior also elicits an intriguing analogy with quantum systems, whereby the observed 
translocation time can be formulated as a weighted average over the whole set of multi- folded configurations (the "pure 
states" of the polymer-pore system). Within this picture, the translocation time acquires an additional dependence 
on the polymer length, through the average value of the folding number (q) y , which increases with the polymer length 
N. Thus, at variance with the case of narrow and short pores [^, |^ flO. Till, [l^ . translocation through wide pores 
allowing for multiple simultaneous strands is not described by a single power-law exponent. To give a specific example 
of scales involved, the size of the pores required to observe this behavior in double-stranded DNA is of the order of 
several (^ 10) times the effective cross-sectional diameter (which depends on salt concentration and repulsion between 
pairs of aligned DNA molecules), while the length threshold above which this behavior emerges is 150,000 base 
pairs (bp's). What is remarkable and counter-intuitive about this behavior is, first, a highly ordered organization 
of the multiple strands at high folding number, and second, the ability of the quantized configurations to flow 
through the wide pore without experiencing any additional drag, compared to the single-file configuration. 

Multiscale model: Our results are obtained from a multiscale treatment of translocation, involving a coarse-grained 
model for the biopolymer in which the basic unit ("bead") is equivalent to one persistence length 50 nm), and the 
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FIG. 1: (Color) Probability distribution of the folding number q for polymer lengths A'^ = 1, 000 — 8, 000 and pore width dp = 9. 
Peaks at integer q values (shown by vertical dashed lines) are evident. The inset shows data for N = 100 — 400 at low-g values. 



molecular motion is coupled to the motion of the solvent in which the biopolymer exists. Incidentally, the length of 
polymers considered here, up to 8,000 beads (equivalent to ^ 1.2 x 10^ DNA bp's), is an order of magnitude above 
any previous simulation in the field. 

The model cou ples microscopic molecular dynamics (MD) for the biopolymer bead motion to a mesoscopic lattice 
Boltzmann (LB) [l3( treatment of the solvent degrees of freedom 10]. In contrast to Brownian dynamics, the LB 
approach handles the fluid-mediated solvent-solvent interactions through an effective representation of local collisions 
between the solvent and solute molecules. The biopolymer translocates through a nanopore under the effect of a 
strong localized electric field applied across the pore ends, similar to the conditions in experimental settings with 
the entire process taking place in the fast translocation regime. 

A periodic box of size NxNyNz{Ax)^ lattice units, with Ax the spacing between lattice points, contains both 
the solvent and the polymer. All parameters are measured in units of the LB time step and spacing. At and Ax, 
respectively (both set to 1); the MD time step is 0.2. We take ^ Ny = with the wall at x ^{N.J2), N,^ = 128 
and the number of beads N in the 100 - 8,000 range. At t — the polymer resides on one side of the separating wall, 
X >{Nx/2), near the opening of a cylindrical pore of nominal length Ip — 3 and nominal diameter dp] we considered a 
narrower, dp = 5, pore and a wider one, dp — 9. Translocation is induced by a constant electric field acting along the 
X direction and confined to a cylindrical channel of the same size as the pore, and length 3 along the streamwise (x) 
direction. The pulling force associated with the electric field, E, in the experiments is QeE — 0.02 and the average 
thermal speed ksT /ra = 10~^, where is the effective charge per bead. The interaction between monomers and 
with the wall are modeled by 6-12 Lennard-Jones potentials |14| . and other aspects of the simulation are the same as 
those in our previous work 10], which successfully reproduced single-file translocation l3, l3]. The effective width 
and radius of the surrounding pore must take into account the repulsive bead-wall interactions that result in an 
effective exclusion distance of ~ 1.5 [10]. Therefore, a monomer is considered to lie inside the pore if contained 
in a pore of effective width 1'^^^ ~ 6 (due to exclusion on both sides of the separating wall) and diameter 
fjeff ^ 7 5 f-Qj. — 9 [d'^ff ~ 3.5 for dp = 5). To measure the residence number of beads in the pore region we define 
a cylinder of length hp — 10 and radius dp centered at the pore midpoint and with axis aligned with the pore. This 
extended region misses monomers close to the pore openings and in contact with the wall, but permits to measure the 
number of beads in a wider region than the pore width with better statistics (reduced variation in the resident-bead 
number). 

Configurational analysis - quantization of the folding number: In Figll] we show the cumulative statistics of the 
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FIG. 2: (Color) Three trajectories of the folding number sampled from the ensemble A'' = 4, 000 and dp = 9. Low q configurations 
are most often visited, but g > 10 can also occasionally occur. The inset shows an actual translocation event at the mid-point 
with g = 9. 



folding number, q = Nres/Ni, collected at each time-step of every single trajectory for a series of 100 realizations 
for each polymer length. Here, Nres is the number of resident beads at each given time for each realization, while 
Ni is the observed single-file value of the resident number, A^i — hp/b'^^^ = hp{l/b + l/crb)/2 ~ 7 (for values of ah 



and b see |14l | and [15l|). The combined statistics over initial conditions and time evolution, produces an aggregate 



ensemble, ranging from about 10^ time frames for the shortest history, {N = 100, dp = 9), up to over 10^ time frames 
for the longest one {N = 8,000, dp — 5). The case dp — 9 reveals a sharp quantization of the distribution of the 
folding number, with well defined peaks, which closely resembles the one observed in double-strand DNA translocation 
through solid nanopores (see Fig. 7 of [16|). For the shortest strands, N = 100, 200, 400, the peaks are shifted to 
values slightly larger than the integer values q — 1,2. This indicates that the polymer spends most of its time between 
the low-fold states q = 1,2. However, for longer strands N > 1,000, the peaks of the distribution appear almost 
perfectly centered at integer values, up to g = 4, 5, 6, respectively. This quantized spectrum is particularly evident for 
the case of the longest polymers, iV = 4, 000 and 8, 000. In this case, "quantum states" up to g = 10 are populated, 
with a slight shift of the peaks to values higher than integer values only for q > 7. Note that the quantization is 
evident up to q = 10, which is still significantly smaller than the largest folding number compatible with the pore 
diameter, qmax = {dp/cfY ~ 20 17]. The value of q above which the quantization gradually diverges from integer 
values, is an increasing function of the polymer length. The case dp = 5 (not presented) shows the same structure, 
though on a smaller range of q values, up to g = 5. 

These data suggest an intriguing analogy with quantum systems, the folding number playing the role of the quantum 
numbers associated with excited states of atomic and molecular configurations. Within this analogy, single-file translo- 
cation (g = 1) would represent the analog of the ground state of the polymer-pore system. Indeed, the long-term, final 
stage of the translocation, is always found to proceed in single-file mode, corresponding to the polymer tail. In this 
respect, long polymers translocating through wide pores are naturally expected to exhibit a richer spectrum of excita- 
tions vs. short polymers translocating through narrow pores. In particular, the number of excited states supported by 
the polymer-pore system should grow quadratically with the pore diameter. Specifically, a polymer of length L = bN 
translocating through a pore of length Ip^^ and diameter d^-^^ , can produce a spectrum of folding numbers up to a 
maximum of qmax oc {dp^-^ /ab)"^ strands (full-packing limit), each consisting of Ni = 1^^^ /b'^f^ monomers. This limit 
can only be saturated by sufficiently long polymers, such that N/Ni :s> qmax, namely N :s> Np ^ I'j/ ^ [d^^ ^ ^ / bal , 
with Np ~ 100 the saturation length in the present work. 

The statistical picture presented above is supported by the dynamic trajectories. In Fig. [21 we show the time- 
evolution of the folding number q{t) for three histories drawn at random from the pool of a hundred realizations of the 
iV = 4, 000 system. A very rich dynamics, with sudden jumps between the various "excited states" , is clearly visible. 
Interestingly, jumps occur both ways, from low to high q and vice-versa, corresponding to absorption/emission of 
"fold-quanta" . This indicates that translocation is not monotonic, but consists of a mixed sequence of folding and 
un-folding events. As anticipated, this sequence is always found to end up in single-file configuration, corresponding to 
the translocation of the polymer tail. Interestingly, the trajectories spend virtually all of their time in quantized states 
with very sharp transitions between them. A snapshot of such a quantized state for a highly folded configurations 
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FIG. 3: (Color) Translocation times as a function of the polymer length (N) for wide pore (dp — 9). Top panel shows the 
average translocation time (green diamonds with associated error bars), the most probable translocation time (purple line) and 
the range between the maximum and minimum translocation times (shaded region). Bottom panel shows a linear fit to the 
low range results 10^ < N < 10^ (dashed line), the multifile translocation time Tmf{N) from Eq. ((2)1 (red squares), and the 
average translocation time (green diamonds). 



{q — 9) is shown in the inset of Fig. [S] 

Scaling exponents: In Fig. [3l we report the translocation time as a function of the polymer length, N, for transloca- 
tion through the wide pore {dp = 9). The most probable time from the time distributions, together with the average 
and the range between the minimum and maximum times are presented. From this figure, it is apparent that up 
to a length N — 1,000, the translocation time r obeys a scaling law of the form r ~ A^", with a ~ 1.36 for the 
most-probable translocation time, slightly larger than the corresponding value for narrow pores [1, 0, [l^, 11, [l^ . 



We emphasize that, regardless of the exact values of the scaling exponents, all translocation indicators, that is, the 
minimum, maximum, most probable, and average translocation times, point clearly towards a deviation from a single- 
exponent power-law in the region > 1, 000, a clear signature of multi-fold translocation. By restricting the analysis 
to the four longest chains, N — 1,000, 2,000, 4,000, 8,000, the bending of the curve (reduced translocation time) 
might be interpreted as the emergence of a new scaling exponent, a2 ~ 0.75. However, as shown below, this bending 
is due to the multi-fold conformation of the translocating biopolymer, which does not necessarily follow a power-law 
dependence on the polymer size. 

The translocation dynamics depends on the strength of the frictional forces exerted by the wall. In the single-file 
scenario, strong friction can change the power-law exponent from ~ 1.2 to a linear relation t (x N [18j . In the 
case of multi-file translocation, a central issue is whether or not the highly folded configurations induce high-friction 
conditions. As demonstrated in Fig. |4l dN/dt is linearly correlated to g, with approximately the same slope for all 
folds. If friction were dominant, dN/dt vs. q would asymptotically reach a constant value with increasing g, which 
is clearly not observed in the simulations. Moreover, the dN/dt vs. q slope depends only slightly on the polymer 
length (data not shown), changing by ^ 30% in going from N — 400 to = 8,000, which is further evidence in 
support of hydrodynamic coherence inside the pore. It is likely that such coherence arises from small velocity 
differences between neighboring beads in the pore, by minimizing bead-bead frictional forces, and/or 
the lubricating effect of the surrounding solvent enhanced by the alignment of strands. Therefore, 
frictional forces have a negligible effect, possibly limited to a small layer close to the wall, and unimportant for the 
group of translocating monomers. This rules out the possibility that the change of exponent is caused by frictional 
forces inside the pore. 

Regardless of the underlying nature of the translocation process, we can compute the translocation time of each 
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FIG. 4: (Color) Scatter plot illustrating the correlation between the folding number q and the rate of translocating monomers 
dN/dt for the wide pore (dp = 9) and long polymer (A'^ = 8, 000) case. Colors indicate the density of points (blue; low, red: 
high); the straight line is the best linear fit. 



realization, with t the total translocation time: 



N = j —dt^KN I Nres{t)dt (1) 







where we have used the fact that dN/dt and N^es are linearly correlated with a constant of proportionality Kj^. Next, 
we write N^esit) = Q{i)Nres,i-, where the subscript 1 stands for the single-file [q — 1) limit of the residence number. 
Eq. ID) then leads to A'^ = KpfNres.iQNT, where is the time averaged value of q. By averaging over all realizations, 
we obtain: 

Nres,l{KN}{qN) {qN) 

where brackets stand for ensemble averaging, Tmf{N) is the multi-file translocation time, and we have defined ti{N) = 
[N /Nres.i){^/ {Kn))\ note that the dependence of {Km) on N is responsible for the non-linearity of ti{N). The 
simulation data show that the average {qn) remains approximately constant ~ 1.2, for TV < 1,000, and then begins 
growing, reaching ~ 2.6 for = 8, 000. Therefore, we conclude that the departure of the translocation time from a 
power-law at large N is mainly due to the increase of (gjv) with polymer length, for > 1, 000. This, in turn, results 
from the shift of the probability distribution of the translocation time towards higher q values as N is increased. For 
all lengths N considered here, the time average {(In) remains below 3, because the states (? = 1 and q = 2 continue 
to be the most populated ones, for both pore diameters dp — 5, 9. We have also checked that the high-g peaks have 
a sizeable effect only on moments {qP) for p > 5. Since the average translocation time is only a first order moment, 
the quantized peaks have little effect on it. This explains why the two pores, dp — 5, 9, show similar dependence 
of {(In) on TV. As a self-consistency check, in Fig. |3]we show the average translocation time for the case dp = 9, 
and compare it to the single-exponent estimate ti{N) ~ _/Vi-3i and the compensated multi-file translocation time, 
Tmf = Ti{N)/{q)N- The reasonable match of t,„^ with the data supports the idea that the speed-up of the longest 
chains can be attributed to the spectral shift of the folding number q. 

Finally, we note that analyzing the in-pore conformation vs. q is an important aspect of multi-file 
translocation. In the present communication we focused on how translocation, and the accompanying 
single-file out-of-pore hydrodynamics, is modulated by the population of the in-pore states, but have 
not analyzed in detail the in-pore conformations, which will be addressed in future work. 
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